General Frameworks for Combined Mining: Discovering Informative Knowledge in Complex Data
نویسندگان
چکیده
Enterprise data mining applications such as mining government service data often involve multiple large heterogeneous data sources, user preferences and business impact. Business people expect data mining deliverables to inform direct business decision-making actions. In such situations, a single method or one-step mining is often limited in discovering informative knowledge. It would also be very time and space consuming, if not impossible, to join relevant large data sources for mining patterns consisting of multiple aspects of information. It is crucial to develop effective approaches for mining patterns combining necessary information from multiple relevant business lines, catering for real business settings and delivering decision-making actions rather than providing a single line of patterns. The recent years have seen increasing efforts on mining such patterns, for example, integrating frequent pattern mining with classifications to generate frequent pattern-based classifiers. Rather than presenting a specific algorithm, this paper builds on our existing works and proposes combined mining as a general approach to mining for informative patterns combining components from either multiple datasets or multiple features, or by multiple methods on demand. We summarize general frameworks, paradigms and basic processes for multi-feature combined mining, multi-source combined mining and multi-method combined mining. Several novel types of combined patterns such as incremental cluster patterns result from such frameworks, which cannot be directly produced by existing methods. Several real-world case studies are briefed which identify combined patterns for informing government debt prevention and improving government service objectives. They show the flexibility and instantiation capability of combined mining in discovering more informative and actionable patterns in complex data. We also present combined patterns in dynamic charts, a novel pattern presentation method reflecting the evolution and impact change of a cluster of combined patterns and supporting business to take actions on the deliverables for intervention.
منابع مشابه
Combined Mining Approach to Generate Patterns for Complex Data
In Data mining applications, which often involve complex data like multiple heterogeneous data sources, user preferences, decision-making actions and business impacts etc., the complete useful information cannot be obtained by using single data mining method in the form of informative patterns as that would consume more time and space, if and only if it is possible to join large relevant data s...
متن کاملPattern Generation for Complex Data Using Hybrid Mining
Combined mining is a hybrid mining approach for mining informative patterns from single or multiple data-sources, multiple-features extraction and applying multiple-methods as per the requirements. Data mining applications often involve complex data like multiple heterogeneous data sources, different user preference and create decision-making actions. The complete useful information may not be ...
متن کاملCombined Mining: Analyzing Object and Pattern Relations for Discovering Actionable Complex Patterns
Combined mining is a technique for analyzing object relations and pattern relations, and for extracting and constructing actionable complex knowledge (patterns or exceptions) in complex situations. Although combined patterns can be built within a single method, such as combined sequential patterns by aggregating relevant frequent sequences, this knowledge is composed of multiple constituent com...
متن کاملAn Unsupervised Learning Method for an Attacker Agent in Robot Soccer Competitions Based on the Kohonen Neural Network
RoboCup competition as a great test-bed, has turned to a worldwide popular domains in recent years. The main object of such competitions is to deal with complex behavior of systems whichconsist of multiple autonomous agents. The rich experience of human soccer player can be used as a valuable reference for a robot soccer player. However, because of the differences between real and simulated soc...
متن کاملA review of text mining approaches and their function in discovering and extracting a topic
Background and aim: Four text mining methods are examined and focused on understanding and identifying their properties and limitations in subject discovery. Methodology: The study is an analytical review of the literature of text mining and topic modeling. Findings: LSA could be used to classify specific and unique topics in documents that address only a single topic. The other three text min...
متن کامل